Fuzzy clustering for indexing in the GAMBAL information retrieval system
نویسندگان
چکیده
Gambal is an information retrieval system for indexing and accessing web pages that includes graphical interfaces to ease web page search and accessing. In particular, the interfaces provide the user with tools for navigating through hierarchies of documents and visualize selected documents and similar ones. Here, similarity is either based on Wordnet 1.7 or Latent Semantics Analysis. Graphical interfaces include both Hierarchical Spherical Clustering (HSC) and Hierarchical Self Organizing Maps (HSOM). In this work we introduce the use of fuzzy clustering for indexing in the HSC interface.
منابع مشابه
Exploration of textual document archives using a fuzzy hierarchical clustering algorithm in the GAMBAL system
The Internet, together with the large amount of textual information available in document archives, has increased the relevance of information retrieval related tools. In this work we present an extension of the Gambal system for clustering and visualization of documents based on fuzzy clustering techniques. The tool allows to structure the set of documents in a hierarchical way (using a fuzzy ...
متن کاملFuzzy Clustering Method for Content-based Indexing
E cient and accurate information retrieval is one of the main issues in multimedia databases. In content-based multimedia retrieval databases, contents or features of the database objects are used for retrieval. To retrieve similar database objects, we often perform a nearest-neighbor search. A nearest-neighbor search is used to retrieve similar database objects with features nearest to the que...
متن کاملFuzzy C-Means Clustering for Biomedical Documents Using Ontology Based Indexing and Semantic Annotation
Search is the most obvious application of information retrieval. The variety of widely obtainable biomedical data is enormous and is expanding fast. This expansion makes the existing techniques are not enough to extract the most interesting patterns from the collection as per the user requirement. Recent researches are concentrating more on semantic based searching than the traditional term bas...
متن کاملUsing Natural Clusters Information to Build Fuzzy Indexing Structure
Efficient and accurate information retrieval is one of the main issues in multimedia databases. However, the key for this is how to build an efficient indexing structure. In this paper, we demonstrate how to use a fuzzy clustering algorithm, Sequential Fuzzy Competitive Clustering (SFCC), to get the natural clusters information from the data. Then use the information to build an efficient index...
متن کاملFuzzy Ontology and Information Access on the Web
Web is the largest available repository of data. In this contribution a solved application of Fuzzy set theory technique to the definition of flexible systems for locating and accessing information on the Web is presented. A purpose of our research is also a fact, that there are various ways to access the big amount of available and mostly unknown information for users. Clustering methods are a...
متن کامل